arXiv:1501.00267vl [cs.DS] 1 Jan 2015 


Fast Generation of Random Spanning Trees and the Effective 

Resistance Metric 


Aleksander Madry* * 
EPFL 


Damian Straszakt 
University of Wroclaw 


Jakub Tarnawskb 
University of Wroclaw 


Abstract 

We present a new algorithm for generating a uniformly random spanning tree in an undirected 
graph. Our algorithm samples such a tree in expected 0(m 4 / 3 ) time. This improves over the 
best previously known bound of min(0(?ny / n), 0(n u )) - that follows from the work of Kelner 
and Mydry [FOCS’09] and of Colbourn et al. [J. Algorithms’96] - whenever the input graph is 
sufficiently sparse. 

At a high level, our result stems from carefully exploiting the interplay of random spanning 
trees, random walks, and the notion of effective resistance, as well as from devising a way to 
algorithmically relate these concepts to the combinatorial structure of the graph. This involves, 
in particular, establishing a new connection between the effective resistance metric and the cut 
structure of the underlying graph. 


1 Introduction 

Random spanning trees are among the oldest and most extensively investigated probabilistic objects 
in graph theory (see, e.g., ED)- Their study goes back as far as to the works of Kirchhoff in the 1840s 
|24| . Since then, it has spawned a large number of surprising connections to a variety of topics in 
theoretical computer science, as well as in discrete probability and other areas of mathematics. For 
example, random spanning trees are intimately related to the task of spectral graph sparsification 
mm and Goyal et al. [18] showed how to directly use them to generate an efficient cut sparsifier. 
They are also closely tied to the notion of electrical flows, a powerful primitive that was instrumental 
in some of the recent progress on the maximum flow problem ED S3]. Finally, random spanning 
trees were at the heart of the recent results that broke certain long-standing approximation barriers 
for both the symmetric and the asymmetric version of the Traveling Salesman problem [3(35]. 

It is, therefore, not surprising that the task of developing fast algorithms for generating random 
spanning trees has received considerable attention over the years ( E3 m El EH El (D 0 S3122] ). 
Broadly, these algorithms can be divided into two categories: determinant-based ones and random 
walk-based ones. 

The determinant-based algorithms rely on the celebrated Kirchhoff’s Matrix Tree Theorem 
J24j (also see, e.g., Ch.3, Thm.8 in [?]) that reduces counting of the spanning trees of a graph to 
computing a determinant of a certain associated matrix. This connection was first used in the 
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works of Guenoche [19i| and of Kulkarni m and resulted in 0(mn 3 )-time algorithms. Then, after a 
sequence of improvements nans], this line of research was culminated with the result of Colbourn 
et al. HD that generates a random spanning tree in time 0( n“), where uj < 2.373 is the exponent 
of the fastest algorithm for square matrix multiplication mm- 

On the other hand, the random walk-based approach to random spanning tree generation is 
based on the following striking theorem due to Broder [9] and Aldous [Tj: 

Theorem 1.1 (HUS3) Suppose you simulate a random walk in an undirected graph G = (V,E), 
starting from an arbitrary vertex s and continuing until this walk covers the whole graph, i.e., until 
every vertex has been visited. For each vertex v other than s, let e v be the edge through which v was 
visited for the first time in this walk. Then, T = {e v : v £ V\ {s}} is a uniformly random spanning 
tree of G. 

In light of this theorem, we can focus our attention on simulating a covering random walk in the 
input graph G = (V, E ) and then just reading off our sampled spanning tree out of the first-visit 
edges of that walk. The running time of the resulting procedure is proportional to the cover time 
of the input graph G. If G has n vertices and m edges, it is known that this cover time is always 
0(mn) [Sj. This improves upon the 0(n u ) bound corresponding to the determinant-based approach 
whenever G is sparse enough. 

Unfortunately, there are examples of graphs for which the above 0(mn) bound on the cover 
time is tight. Therefore, it is natural to wonder if one can obtain a random walk-based algorithm 
for random spanning tree generation that runs faster than the cover time. Wilson |44| was the 
first one to show that this is indeed the case. He put forth a certain random process that is a 
bit different than random walk, but related. Using this process one is able to generate a random 
spanning tree in time proportional to the mean hitting time of the graph. This quantity turns out 
to never be much larger than the cover time and is often much smaller. However, its worst-case 
asymptotics is still 0(mn). 

Later on, Kelner and Mqdry [22] showed that this worst-case bound of 0(mn) can nevertheless 
be improved upon. (Initially, this improvement was only for approximate sampling, but later Propp 
m showed how to make it work for the exact case too - see Lemma 7.3.2 in [32].) They observed 
that to use Theorem [13 one does not need to simulate the covering random walk in full. It suffices 
to generate its shortcut transcript which retains enough information to allow us to recover what 
the first-visit edges were. 

Following this observation, they put forth such a shortcutting technique that relied on a simple 
diameter-based graph partitioning primitive of Leighton and Rao [29]. This technique yields a 
transcript which is significantly shorter than the full transcript of the covering walk and can be 
efficiently generated using fast (approximate) Laplacian system solvers JUJ T2J [251, 26[f23]. This 
resulted in a generation procedure with 0(my/n) total (expected) running time, which is the best 
one known for sparse graphs. 

1.1 Our Contribution 

In this paper, we present a new algorithm that makes progress on the sparse-graph regime of fast 
random spanning tree generation. More precisely, we prove the following theorem. 

Theorem 1.2 For any connected graph G with m edges, we can generate a uniformly random 
spanning tree of G in expected 0(m 4 / 3 ) time. 


2 


This improves over the 0(m^/n) bound of Kelner and Mqdry 
sparse. 


whenever the graph is sufficiently 


1.2 Our Approach 

The starting point of our approach is the Kelner-Mqdry algorithm [22] we mentioned above. Recall 
that this algorithm generates a random spanning tree via simulation of a shortcut transcript of the 
covering random walk. That transcript retains enough information to allow us to recover what the 
first-visit edges were while being relatively short and possible to generate quickly. This results in 
an improved 0(rn^/n) running time. 

Unfortunately, their technique has certain limitations. Specifically, there are graphs for which 
the length of the shortcut transcript they aim to construct is inherently fl(n 3//2 ). (See Section [ 3 ] for 
such an example.) This makes the n(n 3 / 2 ) running time a barrier for their approach. 

Our algorithm broadly follows the theme put forth by Kelner-Mqdry. However, to obtain an 
improved running time and, in particular, to overcome this ^(n 3 / 2 ) barrier, we depart from this 
theme in a number of ways. 

The key new element of our approach is making the effective resistance metric, and its connec¬ 
tions to random spanning trees and random walks, the central driver of our algorithm. Compared 
to the traditional graph distance-based optics employed by Kelner and Mqdry, using such an effec¬ 
tive resistance-based view gives us a much tighter grasp on the behavior of random walks. It also 
enables us to tie this behavior to the cut structure of the graph in a new way. 

Specifically, we develop a relationship between the fact that two vertex sets are relatively far 
away from each other in the effective resistance metric and the existence of a small cut separating 
them. (See Lemma 4.1 in Section |4j) It can be seen as a certain minimum s-t cut-based analogue 
of the celebrated Cheeger’s inequality HE! (see also |28j ) and the role it plays in the context of 
the sparsest cut problem. We believe this relationship to be of independent interest. 

Furthermore, to take full advantage of this effective resistance-based view, we also change the 
way we approach the task of generation of a random spanning tree. Namely, instead of focusing on 
efficient generation of a single shortcut transcript and then using it to recover the corresponding 
random spanning tree in full, we generate a number of carefully crafted transcripts, one after 
another. None of these transcripts alone is enough to extract the whole random spanning tree from 
it. More precisely, each one of them provides us only with a sample from the marginal distribution 
corresponding to the intersection of the random spanning tree with some subset of edges of the 
graph. However, by making sure that all these samples are consistent with each other (which is 
achieved by running each consecutive sampling procedure in a version of the graph that incorporates 
the choices made in previous iterations), we can combine all of them to recover the entire random 
spanning tree in the end. 

Finally, to make the above approach algorithmically efficient, we utilize a diverse toolkit of 
modern tools such as Laplacian system solvers, fast approximation algorithms for cut problems, 
and efficient metric embedding results. 


1.3 Organization of the Paper 

We begin the technical part of the paper in Section [2j where we present some preliminaries on the 
tools and notions we will need later. Next, in Section |3j we provide an overview of our algorithm 
and the description of its main components. We develop these components in subsequent sections. 
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Specifically, in Section [4j we describe our graph-cutting procedure and, in particular, establish the 
connection between the effective resistance metric and graph cuts. Then, in Section [5], we conclude 
by providing and analyzing our sampling procedure that generates the random spanning tree. 


2 Preliminaries 

Throughout the paper, G = (V, E) denotes an undirected connected graph with vertex set V and 
edge set E (we allow parallel edges here). Let V',V" C V be two subsets of vertices of G. We 
define E(V', V”) to be the set of all the edges in G whose one endpoint is in V' and the other one 
is in V" . Also, we define the interior E(V') of V' to be the set of all the edges in G with both their 
endpoints in V' , i.e., E(V') := E(V', V'). Further, the boundary dV' of the set V 1 is the set of all 
the edges in G with exactly one endpoint in V’ . We say that an edge e E E is a boundary edge of 
V’ if e £ dV’ . Finally, for a collection C of subsets of V, we say that an edge e G E is a boundary 
edge of C iff e is a boundary edge of some set V' G C. 


2.1 Random Spanning Trees and Random Walks 


Our objective is to sample a random spanning tree TCG from the uniform distribution, that is, 
so that for every possible spanning tree T of G, the probability that we output T is yTg\' w here Tg 
is the set of all spanning trees of G. 


In light of the connection that Theorem 1.1 establishes between random spanning trees and 
random walks, the latter notion will be at the center of our considerations. 

By a random walk on G we mean the stochastic process corresponding to a walk that traverses 
the vertices of the graph by moving, in each step, from the current vertex u to one of its neighbors 
v, chosen uniformly at random, over the edge (it, v ) G E. It is well-known that this stochastic 
process has a stationary distribution described by the formula n(v) = where d(v) is the degree 
of v (see, e.g., [30]). 

Now, by the cover time Cov(G ) of a graph G we mean the expected time it takes for a random 
walk started at a vertex s G V to visit (cover) every vertex of G. (We take a worst-case bound here 
over all the possible choices of s.) 


2.2 Effective Resistances and the Effective Resistance Metric 

Another key notion we will use to establish our result is that of effective resistance and the effective 
resistance metric. This notion arose originally in the context of study of electrical circuits in 
physics, but since then it has found a number of equivalent characterizations that highlight its 
intimate connection to the behavior of random walks and also to random spanning trees themselves 
(see 

HZ1IB1I33I3DI). The characterization that will be especially useful to us was proved in m (see 
also, e.g., [30]). It states that the effective resistance distance R e g(u,v) between two points u and 
v in G is equal to where n(u,v) is the commute time between u and v, i.e., the expected 

number of steps that a random walk started at u takes before visiting v and returning back to 
u. Also, even more interestingly in our context, it is known that for any edge (u,v) of G, the 
probability that this edge is included in a random spanning tree is exactly R e g(u,v). 

Given the above characterization in terms of the commute time, it is not hard to see that the 
distances R e g(u, v ) give rise to a metric - called the effective resistance metric. We may thus define 
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the effective resistance diameter of a subset of vertices V' C V to be 7 e ff (V 7 ) := max^^gy/ i? e ff(x, y). 
We also set 7 e ff(G) := 7eff(E(G)). 


2.3 Low-Dimensional Embedding of the Effective Resistance Metric 

In our algorithm we will often be interested in computing the effective resistance distance between 
pairs of vertices. It is well-known that, for any two vertices u and v, one can obtain a very good 
estimate of the effective resistance R e g(u,v ) in 0(m) time using the fast (approximate) Laplacian 
system solvers HU 03 E3 HH 122]. Unfortunately, as we will be interested in distances between 
many different pairs of points, this running time bound is still prohibitive. However, Spielman and 
Srivastava [3D] designed a way for computing (approximate) effective resistances distances that has 
significantly better per-query performance. 

Theorem 2.1 Let G = (V, E) be a graph with m edges. For every e > 0 we can find in 0(m) time 
a low-dimensional embedding 1Z of the effective resistance metric into logm) that with 

high probability 

Vu,vEV (1 — s)R e g(u, v) < lZ(u, v) < (1 + e)R e g(u, v), (1) 

and each value oflZ(u,v) can be computed in 0(e _2 logm) time. 

We note that the way we employ the above theorem in our algorithm ensures that even when the 
guarantee (jTj) fails, this does not affect the algorithm’s correctness - just the running time. So, as 
this guarantee holds with high probability, we can assume from now on that 0 always holds. 


2.4 Bounding the Cover Time via Effective Resistance 


In Section |2.2[ we have seen the tight connection between effective resistance and the commute 
time of random walks. As it turns out, drawing on this connection we can relate effective resistance 
to another key characteristic of random walks: the cover time. To describe this, let us recall the 
classic results of [2] (see also m ) that the cover time Cov(G) of a (connected) graph with m edges 
can be bounded from above by 0{mn ) as well as by 0(mA(G)), where A (G) is the graph-distance 
diameter. 

It turns out that by working with the effective resistance-based - instead of the more traditional 
graph distance-based - notion of diameter, one is able to obtain a much tighter characterization 
of the cover time by replacing A(G) by 7 e g(G) in the bound. It is tighter, as it is not hard to see 
that the effective resistance distance is always upper-bounded by the graph distance and thus, in 
particular, 7 e ff(G) < A (G) for any G. Moreover, it is also the case that the resulting upper bound 
on the cover time is close to being a lower bound. The following lemma, proved in Appendix A.l 
demonstrates both bounds. It stems from the work of Matthews [53] . (Also, see [20, 1 (>[ for even 
tighter characterizations of the cover time.) 


Lemma 2.2 For any graph G we have m7 e fr(G) < Cov(G) < 0(m / y e s(G) log m). 


In Section 5.4 we generalize the above lemma to deal with cover times of subgraphs. This may 
be of independent interest. 
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2.5 Ball-growing Partitioning 

An important primitive in our algorithm is the following simple but powerful graph partitioning 
scheme based on the ball-growing technique of Leighton and Rao |29j . It is in fact a simplified 
version of the ( 0 , 7 )-decomposition used in [ 22 ] , 

Let us first introduce the following definition. 


Definition 2.3 We say that a partition of a graph G with m edges into disjoint components 
Vi..... 14 is a (4>, 7 )-decomposition of G iff 

(1) the graph distance-based diameter A(V)) of each Vi is at most 7 , 

(2) the total number of boundary edges of V\,... ,Vk is at most cf> ■ m. 


As it turns out 


see Appendix A.2 


we can always find the following ( 0 , 7 )-decomposition fast. 


Lemma 2.4 For every f> = o(l) and every graph G with m edges, we can find a 
decomposition in 0(m + n) time. 


2 log(m+l) 


2.6 Approximating Minimum s-t Cuts 

One of the key parts of our algorithm relies on being able to quickly identify an approximate 
minimum s-t cut in an undirected graph. To do that we use the recent fast (1 + e)-approximate 
maximum s-t flow results of Sherman [39), Kelner et al. |21j and Peng [36]. 

Theorem 2.5 For any graph G, any two vertices s and t, and any e > 0, one can obtain a 
(1 + e )-approximation to the minimum s-t cut problem in G in 0{me ~ 2 ) time. 


3 The Overview of the Algorithm 

The starting point of our algorithm is the random walk-based approach proposed by Theorem 0 
and the Kelner-Maxlry technique [22] of speeding it up. 

Recall that the bottleneck of the vanilla random walk-based approach is that performing a step- 
by-step simulation of a covering random walk can take Q(mn) time. To circumvent this problem, 
Kelner-Mqdry observed that we actually never need to simulate this covering random walk in full. 
We just need to be able to recover all its hrst-visit edges e v . As it turns out, the latter can be 
done more efficiently by considering a certain “shortcut” version of that walk and simulating this 
shortcutting instead. 

Roughly speaking, the design of this Kelner-Mqdry shortcutting boils down to finding a partition 
of the graph G into regions that have diameter at most 7 each while cutting at most a </>-fraction of 
edges, for some choice of parameters 7 and <f>. (One can obtain such a partition using the standard 
ball-growing technique of Leighton and Rao [29].) Then, one starts a step-by-step simulation 
of a covering random walk in G. The crucial difference is that whenever some region is fully 
covered by the walk, one starts shortcutting any subsequent visits to it. More precisely, upon 
future revisits to such an already-covered region, one never simulates the walk inside it anymore, 
but instead immediately jumps out of it according to an appropriate exit distribution. As [ 22 ] 
shows, these exit distributions can be computed (approximately) using fast Laplacian system solvers 
HU H2l EH [261 [23]. Further, the resulting overall (expected) time needed to perform the whole 
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shortcut simulation is at most 0(m (7 + (pm)) = 0 (m 3//2 ) (with the optimal choice of parameters 
being 7 = 0(y/m) and (p = 0(1/y/m)). 

The above technique can be further refined to yield a total running time of 0(mn 1//2 ), which is 
an improvement for non-sparse graphs (see [22] for details). However, H(n 3 / 2 ) time turns out to be 
a barrier for this approach, even in the sparse-graph regime. 
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Figure 1: An example of a graph on which Kelner-Mqdry algorithm runs in Q(nz) time. It consists 
of two expanders G\ and G2, each having Q(n) vertices, connected by y/n disjoint paths of length 
Q(y/n) each. 

To understand why this is the case, consider the graph depicted in Figure [1} This graph has 
an 0 (y/n) diameter. Therefore, if one is interested in generating a random spanning tree only in 
0 (n 3 / 2 ) running time, one can simply simulate its covering random walk faithfully - by a well-known 
result [ 2 ], this walk will have an expected length of 0(n 3 / 2 ). 

However, if one wanted to use the Kelner-Mqdry approach to get an improved running time 
here, one hits a bottleneck. The difficulty is that the graph cannot be partitioned in a way that 
makes the diameter 7 of each piece, as well as the number (pm of edges cut, both be o(y/n). (In 
particular, the two expander-like parts G\ and G2 of this graph are especially problematic due to 
their size and being far away from each other as well as having large boundaries.) So, the running 
time of the Kelner-Mqdry algorithm is always H(m( 7 + (pm)) = H(n 3//2 ) here. 

3.1 Breaking the f2(n 3 / 2 ) Barrier 

Our improved 0(m 4//3 )-time algorithm that, in particular, breaks this H(n 3//2 ) running time barrier, 
stems from a number of new observations. We present them here in the context of the example 
graph depicted in Figure [lj This will serve as an informal motivation for the general techniques 
that we introduce in the remainder of the paper. 

The key observation to make is that random walks are a spectral notion, not a combinatorial 
one. Therefore, basing our analysis on the effective resistance metric - instead of applying the 
traditional graph distance-based optics - turns out to be a better choice. 

In particular, if one considers our graph in Figure [l] from this perspective, one can notice that 
the effective resistance-based geometry of that graph differs from its graph distance-based geometry 
in a very crucial way. Namely, the effective resistance-based diameter of the whole graph is still 
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D(n 1//2 ). (This can be seen by looking at the middle vertices of any two different paths in G .) 
But, the two expander-like parts G\ and G 2 — which were far away from each other in the graph 
distance-based sense - are actually very close from the point of view of effective resistance distance! 
(This difference stems from the fact that the latter notion depends not only on the length of the 
shortest path but also on the number of disjoint paths.) 

As a result, the union G\ U G 2 of these two parts forms a region that can be covered relatively 


efficiently by a random walk. More precisely, as we show in Lemma 5.3, a random walk in G needs 
only a small number of steps inside G\ U G 2 before covering this region in full. 

Now, to turn this observation into a fast routine for random spanning tree generation, we need 
to cope with two issues. Firstly, Lemma 5.3 bounds only the number of steps inside the region 
Gi U G 2 . Secondly, it measures the number of steps needed before the region is covered - the 
number of steps needed to cover the whole graph G might (and actually will) be much larger. 

Dealing with the former issue is relatively straightforward in our example. After all, the outside 
of our region in G is just a collection of paths. So, it is easy to partition it into small sets with 
relatively sparse boundaries and then use the shortcutting technique of Kelner-Mc|,dry to simulate 
the random walk over these sets efficiently. 

To deal with the latter issue, we abandon the original desire of sampling the whole random 
spanning tree “in one shot”, i.e., from a trajectory of a single covering random walk. Instead, we 
first sample only a part of the random spanning tree: its intersection with our region Gi U G 2 . This 
corresponds to simulating a random walk that covers G\ U G 2 (but not necessarily the whole graph 
G ), and we already know how to do this fast. 

Once this intersection of our region and the random spanning tree is sampled, one can simply 
encode this sample in G by contracting the edges of our region that were chosen and removing the 
remaining ones. As G\ U G 2 constituted a significant fraction of the edges of G (and was also the 
most troublesome structure in G), this sampling makes large progress and thus allows us to simply 
recurse on the updated (and smaller) version of G. 

Now, even though the above discussion was focused on our example graph, it actually already 
covers almost all the crucial insights behind our general algorithm. 

In particular, at a high level, the main goal of our algorithm is exactly to try to identify in 
the input graph G some large region that, on one hand, has a small effective resistance diameter 
and, on the other hand, has an exterior “shatterable” enough to enable efficient Kelner-M^dry-type 
shortcutting. Once such a region is identified, we can make sufficient progress by sampling from its 
intersection with the random spanning tree, as described above. (Observe that one of the benefits 
of this approach is that we never need to shortcut the random walk over this region.) 

At this point, the only major missing element is a method of dealing with situations when such 
a large small-radius region with “shatterable” exterior does not exist. (For instance, such a situation 
arises when we remove all but one path connecting G\ and G 2 in our example graphQ) To cope 
with such cases, we establish a relationship between the effective resistance distance and the cut 
structure of the graph. Specifically, we prove that in such cases there has to exist a small cut that 
separates two large parts of the graph. (See Lemma 4.1 in Section [4] for details.) Therefore, in lieu 
of advancing our sampling, we can make progress by refining our grasp on the cut structure of the 
input graph. 


Tn this case, Gi and G2 are no longer close in the effective resistance distance, and neither Gi nor G2 can be 
partitioned into small sets with sparse boundary. 






In the sequel, we proceed to making the above ideas rigorous and introducing the key technical 
notions behind our algorithm. 

3.2 Subdivisions, Overlays, and Covering Families 

The graph-theoretic notions that will be key to our algorithm are subdivisions and covering families. 
A subdivision C is any collection of disjoint subsets of vertices. Then, a covering family C = 
(Co,..., C() is any collection of I + 1 subdivisions Co,..., Ci - where I \= \ logm - such that (1) 
Co = {F}; and (2) for each 0 < i < £, and any set C C V in Cj, the total number of edges in the 
interior E(C) of C is at most m/2*, i.e., for each 0 < i < £ and all C G Cj, 

771 

\E(Q\ < (2) 

where m is the number of edges of our input graph G. (Condition ( 1 ) in the above definition is just 
a convention that we employ to ensure that each vertex always belongs to at least one subdivision 
in the covering family. We also assume I to be integral for simplicity.) It is important to observe 
here that even though sets contained in the same subdivision Cj need to be disjoint, sets belonging 
to different subdivisions can (and often will) overlap. 

Now, the key reason for our use of covering families is that, even though they do not directly 
constitute a partition of the graph G, they still provide an implicit - and much more convenient 
to deal with from the computational point of view - representation of certain partitions of G. 
These partitions will play an important role in our algorithm. To define them formally, consider a 
covering family C = (Co, ... ,Cf). Then, the overlay V(C) of C is the partition of G resulting from 
superimposing all the subdivisions Cj of C on top of each other. That is, V(C) is a partition of G 
in which, for any two vertices v and u of G, these vertices are in the same component of V(C) iff 
they are not separated by any set in C, i.e., for every set C in each subdivision Cj of C, it is the case 
that either u and v both belong to C or both are not a part of C. 

3.3 Shattering and ct-bounded Covering Families 

Our goal will be to obtain covering families C that, on one hand, have their overlay V(C) partition 
the graph G into components of relatively small effective resistance diameter and, on the other 
hand, have a relatively small number of boundary edges in each of their subdivisions. 

To make this precise, let us fix some covering family C = (Co,... ,Cf). We will call C shattering 
iff all the components in its overlay V(C) have effective resistance diameter at most 7 * := 56m 1//3 . 
Also, let us call C a-bounded , for some a > 0, iff for each 0 < i < £, the number of Abased boundary 
edges in C is at most am 1 / 3 2 *. Here, an edge e is an i-based boundary edge of C, for some 0 < i < £, 
iff e belongs to the boundary dC of some set C G Cj, but is not part of the boundary of any set C' 
in Cj/ with i' < i. 

Before proceeding further, let us observe in passing that the fact that any boundary edge in Cj 
has to be an i'-based boundary edge for some i! < i makes the a-boundedness property also give us 
a bound on the total number of all boundary edges of each subdivision Cj. Namely, the following 
simple fact holds. 

Fact 3.1 If C = (Co, ... ,C() is an a-bounded covering family then, for each 0 < i < I, the total 
number of boundary edges of the subdivision Ci is at most 2am 1 // 3 2*. Furthermore, the total number 
of all boundary edges in the whole C is at most 2am 1 ^ 3 2 £ = 0(am 2//3 ). 
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Now, in the language of the above definitions, our interest will be in finding covering families 
that are shattering and a-bounded for some a being polylogarithmic in m^\ As we show in Section 
[4j such covering families can indeed be found efficiently. In particular, we prove there the following 
lemma. 


Lemma 3.2 For any a > 0 and any a-bounded covering family C = (Co,... ,Ci), we can, in 
0(m 4//3 ) time, extend this C - by only adding new sets to its subdivisions or subdividing the existing 
ones - to a covering family C = (C' 0 ,... ,C' e ) that is shattering and a'-bounded with a' = a + a* 
and a* being a fixed parameter that is polylogarithmic in m. 


Note that the claimed existence of a log 0 *' 1 ) m-bounded and shattering covering family follows 
directly from this lemma once one takes C to be a trivial 0-bounded covering family with Co = {H} 
and all other Cj being empty. However, as we will see shortly, our algorithm will actually need the 
above, more general, statement. 

To try to motivate the definition of a covering family and its a-boundedness, let us get a little 
ahead and say that each boundary edge of Cj will incur a preprocessing time cost proportional to 
the maximum size of a set in Cj (cf. Lemma 5.1). For an a-bounded covering family C, the sum of 
these quantities is bounded by am 1 / 3 2* • ~ = am 4 / 3 for every i. Thus, we will be able to support 
our running time bound by maintaining the trade-off between sizes of sets in Cj and the number of 
boundary edges of Cj. 


3.4 F-Marginals, (F, F')-Conditioned Graphs, and Minimum-Age Interiors 

As we mentioned earlier, our algorithm, instead of trying to sample the whole random spanning 
tree at once, will perform a sequence of samplings from certain marginal distributions. Namely, for 
a given subset of edges F C E of the graph G, we will be interested in sampling from the marginal 
distribution - that we will call the F-marginal of G - describing the intersection of the random 
spanning tree with the set F. 

Observe that the connection between random walks and sampling random spanning trees pro¬ 
vided by Theorem 0 can be straightforwardly extended to give a sampling procedure for any 
T-marginal. We can just simulate a covering random walk X in G, starting from an arbitrary 
vertex s, and take as our sample the intersection Ep := {e v : v 6 V \ {s}} n F of F and all the 
first-visit edges e v to all the vertices of G other than s in that walk. 

Now, the key observation to make here is that to sample from the T-marginal of G, not only 
do we not need to simulate such a covering random walk X in full, but also we do not even need 
to recover all the first-visit edges e v . All we need is just to be able to extract all the edges e v 
that could belong to Ep. So, as trivially Ep C F, we do not ever have to keep exact track of the 
movement of the walk X over edges which have no common endpoint with an edge of F. 

It turns out that having such ability to choose some F and then mostly ignore the parts of 
the random walks that do not touch F is very convenient and our algorithm takes advantage of 
that in a crucial way. Namely, as we show below, in any graph, there is always a choice of an 
appropriate set F so that sampling from the corresponding T-marginal can be performed efficiently 
and it non-trivially advances our progress on recovering the whole random spanning tree. 

J The exact value of a is irrelevant for us - we treat it as a factor hidden under O(-). 
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To formalize this, let us consider some covering family C = (Cq, ... ,Ce) and let V(C) be the 
corresponding overlay. Given some component P € V(C) of that overlay, we say that P has age 
equal to r, for some 0 < r < i, - we will denote this by age(P) = r - iff 

• P is a contained in some set C - that we will call the age witness W(P) of P - from the 
subdivision C r of C; 

• and P is not contained in any set C l from a subdivision C r ' with r' > r. 

(Note that due to the definition of the overlay it must be that each component P of that overlay can 
only be either fully contained in or completely disjoint from each set C in the covering family.) Also, 
observe that, as in our convention each covering family has Co = {1C}, the age of all components of 
the overlay is well-defined. 

Finally, given a covering family C, we define its age r* to be the smallest age of all the components 
of its overlay V(C), and its minimum-age interior F* to be the union of interiors E(P ) of all the 
components of its overlay V(C) that have such minimum age r*. 


3.5 Efficient Sampling from Minimum-Age Interiors 

Now, one of the crucial properties of the minimum-age interior F* of a covering family C is that 
as long as C is shattering and log°^ m-bounded, we can efficiently sample from the corresponding 
E^-marginal. More specifically, in Section [5] we establish the following lemma. 

Lemma 3.3 Given any covering family C = (Cq, ... ,Ce) that is shattering and a-bounded, for 
some a being polylogarithmic in m, let F* be its minimum-age interior. We can sample from the 
corresponding F*-marginal in (expected) 0(m 4 / 3 ) time. 


Once we have obtained a sample F' C F* from our E*-marginal, as in the lemma above, it is 
important to ensure that this sample remains consistent with all the subsequent samplings that 
we perform. A simple way to achieve this is by encoding this random choice of F' into the graph 
G. Namely, given the graph G, the subset F* C E of its edges and the sample F' C F* from the 
corresponding E^-marginal, let us define the (F *, F')-conditioned graph G' to be G after we contract 
in it all the edges in F' and remove all the edges in F* \F'. It is not hard to see that sampling a 
random spanning tree of G that is consistent with our sample F' , i.e., finding a random spanning 
tree T conditioned on the fact that the intersection of T and F* is exactly F', boils down to finding 
a random spanning tree in the (F*, E'j-conditioned graph G'. So, to ensure consistency across all 
our samplings from consecutive marginal distributions, we just need to make sure to always work 
with the corresponding conditioned graph. And it is easy to keep track of all edge contractions so 
as to be able to recover the whole random spanning tree in the original graph at the end. 

The key consequence of sampling from the E^-marginal, with F* being the minimum-age inte¬ 
rior, is that it allows us to make non-trivial progress, as measured by the structure of the covering 
family that we maintain. That is, either the age of that covering family increases, or the interiors 
of all the components of its overlay partitioning become empty (which by Fact 3.1 means that there 
are few edges left overall). This is made precise in the following lemma, whose proof appears in 


Appendix A.3 


Lemma 3.4 Let G be a graph and let C = (Co, ... ,C{) be an age-r* covering family in G that 
is a-bounded, with its minimum-age interior being F*. Given a sample F' C F* from the F*- 
marginal, we can construct, in 0(m) time, the corresponding (F *, F')-conditioned graph G', as well 
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as a covering family C = (Cq, ... ,C'f) in that graph. Furthermore, this C' will be a-bounded and, if 
r* < t, then the age of C will be at least r* + 1; otherwise, i.e., if r* = l, then each edge of G' will 
be in the boundary dC of some set C in C'. 

An important technical convention in this context is that the value of m that we use in the definitions 
of covering family (see ([ 2 ])) and of a-boundedness always refers to the number of edges of the original 
input graph, i.e., the one before we start removing and contracting edges. (This, of course, means 
that the running times that we cite in all lemmas are also always proportional to the number of 
edges of original graph, not the conditioned graph we might be applying them to.) 


3.6 Our Algorithm 


Having described above all the key ideas and tools behind our approach, we are finally ready to 
present our algorithm. In fact, at this point, it boils down to a simple iterative procedure. 

Namely, our algorithm maintains a graph G and an a-bounded covering family C in that graph, 
for some a being polylogarithmic in m. Initially, G is the original input graph and C is the trivial 
0-bounded and age-0 covering family with Co = {H} and all other C, being empty. 

In each iteration, given G and C, we first apply Lemma 3.2 to C to obtain a covering family C 1 


that is shattering and a'-bounded with a! = a + a*. Next, we use Lemma 3.3 to obtain a sample 
F' from the T-marginal corresponding to the minimum-age interior F of C 1 and then employ the 


procedure from Lemma 3.4 (with C = C 1 ) to obtain the corresponding a'-bounded covering family 
C' in the (F, F^-conditioned graph G'. (Note that even though C 1 was shattering, this C might 
no longer be such, as the removal of edges during construction of G' can significantly increase the 
effective resistance distances.) 

If it is not the case at this point that all the edges of G' are boundary edges of some set in C', 
we just proceed to the next iteration with C being equal to C' and G equal to G'. 

Otherwise, i.e., if indeed all the edges of G' are boundary edges for some set in C , we can 
stop our algorithm and note that this condition together with the a-boundedness of C and Fact 


3.1 


implies that G' has at most 0[am 2 / 3 ) = 0(m 2//3 ) edges. (Recall that in our convention rn 
denotes the number of edges of the original input graph.) So, this graph is small enough that we 
can afford to just simulate the whole covering random walk in it and use Theorem 0 to recover 
the corresponding random spanning tree. (Observe that the expected length of this covering walk 


can be easily bounded in terms of the square of the number of edges of our graph - see Section 2.4 
- which results in a bound of 0(m 2//3 • m 2 / 3 ) = 0(m 4 / 3 ).) 

By combining the edges of this tree with the edges contracted during our repeated constructions 
of conditioned graphs, we obtain the desired random spanning tree in the original graph. It is not 
hard to see that the above finishing procedure can be implemented in total 0(m 4 / 3 ) (expected) 
time, as needed. 

Now, to analyze the iterative part of our algorithm, observe that, by Lemma 3.2 
operations used to obtain C 1 


the only 

from C are adding new sets and subdividing the existing ones. As 


these operations cannot decrease the age of any component in the overlay (but could increase it), 
we can conclude that the age of C 1 is at least the age of C. As a result, Lemma 3.4 ensures that 
with each iteration of our algorithm, the age of the maintained covering family C increases by at 
least one. 

Therefore, there can be at most l+l = O(logm) such iterations before the algorithm terminates. 
This observation implies that, on one hand, our maintained covering family C is indeed always a- 
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bounded for some a being polylogarithmic in m. (This follows since a is initially 0 and then, by 
Lemma 3.2 each iteration increases it by an additive term of at most a* that is polylogarithmic.) 


On the other hand, this observation allows us to conclude that by Lemmas 3.2, 3.3 and 3.4, the 
total (expected) running time of our algorithm is at most 0(m 4 / 3 ). 

In light of all the above, Theorem 1.2 follows. 


4 Cutting the Graph 


In this section, we prove Lemma |3.2| That is, we show how to expand a given cn-bounded covering 
family C of the graph G to a covering family C that is still aAbounded for a! that is not much bigger 
than a and, furthermore, is shattering, i.e., has its overlay V(C') partition G into components of 
small effective resistance diameter. (Consult Section [3] for necessary definitions.) 

At a very high level, our approach is a simple iterative procedure that, as long as C is not yet 
shattering, considers one-by-one all the components P of the current overlay partition V(C) whose 
effective resistance diameter may still be too large and applies a fixing routine to them. This fixing 
routine boils down to first identifying good cuts in G that divide the component P and then adding 
to C carefully-crafted sets (or subdividing the existing ones) based on these identified cuts. (This 
results in subdivision of P in the resulting overlay V(C) and thus in reduction of its diameter.) 

Roughly speaking, the way such good cuts for P are identified is via a combination of two 
partitioning techniques: one purely combinatorial and the other one relying on a certain new 
connection between the effective resistance metric and graph cut structure. More precisely, in our 
fixing routine, we first apply the standard ball-growing primitive, as described in Section [2.5[ to P. 
If this results in a partition of P into pieces that have sufficiently small diameter and size (while 
not cutting too many edges), we use the resulting sets to update C accordingly. On the other hand, 
if that fails, we show that it must be that P contains two large parts that are far away from each 
other in effective resistance metric. As we prove then, these parts need to have a relatively small 


cut separating them in the graph G (see Lemma 4.1 below) and we (approximately) recover this 
cut using the recent close-to-linear-time minimum s-t cut algorithms - see Section [2.6| Based on 
that cut, we update C in a very careful manner. 

We proceed now to describing our procedure in more detail. 


4.1 The Graph-Cutting Procedure 


Let us first recall the setting of Lemma 3.2 more formally. We are given an a-bounded covering 
family C = (Co,... , Q) and we want to expand it so that it would become shattering and still be 
(a + a*)-bounded with a* = log°^ rn. In this expanding, we are only allowed to add new sets to 
C and to subdivide the existing ones. 

To facilitate our analysis, we also maintain an additional constraint during our procedure. 
Namely, for each such subdivision C* with 0 < i < £, we will only be adding new sets - never 
subdividing them. (So, subdivision of existing sets can happen only in the subdivision Cg.) Also, 
we will never modify subdivision Co- 

Before we start our procedure, we construct, in nearly-linear time, the (1 + e)-approximate 
low-dimensional embedding 1Z of the effective resistance metric of G - as described in Theorem |2.1| 
in Section 2.3 - with e = This way, we will be always able to compute quickly (i.e., in O(logm) 
time), for any two vertices u and v in G, an approximation lZ(u,v) of the effective resistance 
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distance R e g(u,v) between them such that 


-TZ(u,v) < Res(u,v) < 27 Z(u,v). 

O 


(3) 


Now, our procedure operates by iterating over all the yet-unseen vertices u of G. (Initially, all 
vertices are unseen.) For each such u, we identify the component P £ V(C) in the overlay V(C) 
of the current C that contains u and run a cutting procedure - denoted by Cut(P) - on it. The 
objective of this procedure is to subdivide P (by adding new sets to C, or subdividing existing ones, 
in order to refine V(C)) into smaller components, all of effective resistance diameter 0{m 1 / 3 ). At 
the end of Cut(P), we mark off all vertices of P as seen. We now describe this procedure. 

To this end, let u s fix some P £ V( C). We start by applying the ball-growing primitive — as 

to it to obtain a (O , m 1 / 3 ^decomposition of P 


described by Lemma 


2.4 


in Section 


2.5 


into pieces of effective resistance diameter at most m 1 / 3 . (Recall that the graph-diameter distance 
always upper-bounds the effective resistance diameter - see Section 2.4 ) For a given piece Pi in 
this decomposition, we call it large if its interior \E(Pi)\ is of size greater than m 2 / 3 ; otherwise, we 
call it small. Let P\,... ,Pk (possibly k = 0) be all the large pieces and let L = (jjLi -f* be their 
union. 

We now choose an arbitrary vertex Vi £ Pi, for each such large Pi, and then use our low¬ 
dimensional embedding 1Z to (approximately) compute the effective resistance distances between 
v\ and all the other u,s. (This can be done in O(klogm) = 0(\P\ + \E(P)\) time.) 

If it turns out that at least one of these distances is larger than ^m 1 / 3 , then we say that the 
ball-growing procedure failed and proceed to our alternative, more sophisticated, cutting method 
- described in Section 4.2 below - that deals with this case. This method encapsulates the key 


advantage of our choice to employ the effective resistance-based optics instead of the usual graph 
distance-based one. (Intuitively, the failure of the ball-growing procedure corresponds to the graph 
structures - such as the graph in Figure [I] - that give rise to the fl(n 3 / 2 ) barrier faced by the 
Kelner-Mcglry method [22].) 

On the other hand, if ball-growing procedure does not fail, i.e., all these distances lZ{y\,Vi) are 
at most i^m 1 / 3 , we are able to deal with the component P relatively easily. To see that, observe first 
that in this case all the large pieces have to be within a ball of small effective resistance diameter, 
i.e., we can bound the effective resistance distance R e g(u,v) between any two vertices v' £ P# and 
v" £ Pi" as 


27 

Ref[(v',v") < dist(v',Vi/) + R e s(vii,vi) + R e f[(vi,Vi/r)+dist(vi",v") < 4- — m 1 / 3 + 2-m 1 / 3 = 56m 1 / 3 , 

2 (4) 

where we used ([3]) and the fact that each Pi has a diameter of at most 7 e ff(-Pj) < A (Pi) < m 1 / 3 . In 
other words, 7 e ff(L) < 56m 1 / 3 . 

Then, we update the covering family C by just adding all the pieces that are small (not large!) 
to the subdivision Cg. By definition, all these small pieces are of size at most m 2 / 3 = p, as needed 
- cf. ([2]). Also, note that if age(P) = ^ and thus P is already contained in some existing C £ Cg, 
adding these pieces is not valid as they would be contained in C. In this case, we just modify C by 
removing all the pieces from it - effectively, this corresponds to subdividing C and thus is a valid 
operation. 

Finally, we note that after the above update, the component P is split into components that 
correspond to each of the small pieces and the union L of all the large pieces. Each of these 
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components - in particular, L — have effective resistance diameter of at most 56m 1 / 3 = 7 * (see 0 ). 
So they satisfy the diameter condition needed in the definition of a shattering covering family - 
see Section [3 .3| - and thus we can mark all the vertices of P as seen and our cutting procedure can 
proceed to the next unseen vertex. 


4.2 Cutting the Component P when Ball-growing Fails 

We describe now how to deal with the case when ball-growing fails, i.e., if we have that for at least 
one large piece Pi with 2 < i < k that was produced by running the ball-growing primitive on P, 
lZ(vi,Vi) > Let us assume wlog that i = 2. 

At this point, we can forget about all the other large and small pieces that our ball-growing 
procedure found in P - from now on we focus exclusively on the large pieces P\ and P 2 . The first 
observation to make in this situation is that the fact that P(ui,u 2 ) > ^m 1 / 3 implies that P\ and 
P 2 are relatively far from each other in the effective resistance metric. In particular, by ([3]), the 
effective resistance distance _R e ff(t>i, U 2 ) between v\ and V 2 can be lower-bounded as 

u 2 ) > ^ • jjm 1/3 > m 1/3 + A(P 1 ) + A(P 2 ) > m 1/3 + 7 e ff(Pi) + 7 eff (F 2 ), (5) 

where we used the fact that the diameter of P\ and P 2 is at most m 1 / 3 . 

Now, the crucial insight here - and, in fact, one of the key new ideas behind our improvement - is 
that distances in the effective resistance metric between vertices are related to the sizes of minimum 
cut separating them. In particular, as we show in the lemma below, one can relate the fact that 
Pi and P 2 are far away in the effective resistance metric to existence of a small cut separating 
these sets in G. (This connection can be viewed as a certain analogue - for the minimum s-t cut 
problem - of the role Cheeger’s inequality 0E1 and its higher-eigenvalue generalization [28] play 
in the context of the sparsest cut problem. In both cases, one is tying certain algebraic properties 
of the Laplacian - in the case of Cheeger’s inequality, these are its eigenvalues, here it is its full 
(pseudo-)inverse - to the quality of the corresponding cut in the graph.) 


Lemma 4.1 Let U,W C V be disjoint sets in a graph G having at most m edges and let u (resp., 
w) be a vertex in U (resp., in W). There exists a cut K* in G that separates U and W and has 
size at most _ 


I K*\ < 


m. 


7 u, w {u,wy 


where we define 7 u ,w{U, W ) := |P e ff(^, w) — 7 e s{U) — 7 e ff(kL) and assume it to be strictly positive. 


Proof: Let us define k := |_y -— jjj\\ 7 + lj- We claim in the lemma that there is a cut of size 

at most k — 1 that separates U from W in G. Assume for the sake of contradiction that there is no 
such cut of size less than k. This means that in a graph G' which is created from G by contracting 
U into a vertex u! and W into a vertex w’ there exists no u'-w’- cut of that size too. As a result, 
there exist k edge-disjoint paths between v! and w' in G'. Once we lift these paths to G, we will 
obtain k paths pi, ... ,pk, with each pi connecting a vertex Ui G U to a vertex Wi € W and being 
of some length dj. 

Our goal is to derive a contradiction by showing that ^R e s(u, w ) < 7 e ff(L r ) + +7eff(!F). (Note 

that once we use the definition of k and rearrange the terms, this indeed will be a contradiction.) 
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To this end, we will use an alternative characterization of effective resistance. According to this 
characterization, for any two vertices v and v ', the effective resistance distance R e s(v,v') between 
these vertices is equal to the energy £(f*) of the minimum-energy flow f* that sends one unit of 
flow from v to v' in G. Here, the energy £(/) of a flow / is equal to ]P e fe an d corresponds to G 
being treated as an electric circuit with all edges being unit resistors. (See, e.g., [31, 8] for a proof 
of this equivalence.) 

In light of the above characterization, we can prove upper bounds on R e s(u,w) by bounding 
the energy of some (not necessarily optimal) flow / that pushes one unit of flow from u to w in 
G. To specify the flow / that we want to use for this purpose, let us first define, for every i, ff 
to be a minimum-energy unit flow from u to tq. Note that the energy £(ff) of ff can be at most 
7 es{U) - this is so as, by the above characterization, 7 e ff (U) is the upper bound on the energy of a 
minimum-energy unit flow between any two vertices in U. Similarly, let ff ! be a minimum-energy 
unit flow from u>i to w — its energy is at most 7 e ff(W). Finally, let ff be the unit flow from m to 
Wi that simply pushes the whole flow via the path p t (of length df). Observe that, for each i, we 
have £{ff) = di . 

Now, our desired flow / is defined as the following convex combination: 

/==££ (/“ + /* + /?)■ 

i 

Note that each sum of the flows ff, ff and ff 1 gives rise to a unit flow from u to w. As a result, 
their convex combination / is also such a unit flow, as desired. 

To bound the energy of /, we observe that by virtue of edge-disjointness of the paths pt we have 



i 


1 \ -\ TYl 

- F’ 


( 6 ) 


where we also used the fact that di < m, as G has at most m edges. 

Next, we note that the energy is a sum of quadratic functions and thus Jensen’s inequality 
implies that for any set of flows f t and parameters a* > 0 we have 


£ 




Using this, our energy-based characterization of effective resistance and the bound ([6]), we obtain 


\rMu, w) <le(f) = E %f" + E + E 

V i i i / 

s £ (E(/“)+ £ (E^)+ £ (El/r) 

TTl 

< 7 es(U) + p- + 7eff(bF), 


16 


which is the desired contradiction. 


□ 


Once we have established the above lemma, we can apply it in our context and notice that, by 
<(5j) , j v1 ,v 2 (Pi,P 2 ) > m 1 / 3 , which gives us that there exists a cut K* in G that separates Pi from 
P 2 and is of size at most m 1//3 . As a result, by contracting P\ and P 2 into single vertices v\ and V 2 
and then using the close-to-linear time approximate minimum s-t cut algorithms - see Section 2.6 


- on the resulting graph, we can recover, in total 0(m) time, such a cut I\ that separates Pi from 
P 2 in G and is still of size at most 0(m 1//3 ). (It is worth noting here that the cut K is a global 
cut in G, i.e., it is not contained in the component P that we are processing and, in fact, it can 
intersect all the sets in our covering family C.) 

Our goal now is to use this cut I\ to carefully craft a new set S to be added to one of the 
subdivisions C* for i < £. Adding this set will ensure that Pi and P 2 become separated in the 
resulting new overlay V(C). To achieve this, we need to ensure, in particular, that, on one hand, 
this set does not intersect any existing sets in C% and, on the other hand, that it does not contain 
too many boundary edges. As the following lemma (proved in Appendix |A.4[ ) shows, such a set S 
can indeed be found and, additionally, it satisfies all the other constraints that we imposed on our 
algorithm. 


Lemma 4.2 Let C = (Co, ..., Cf) be a covering family of a graph G, and let P be a component of its 
overlay partition V{C). Also, let P\ and P 2 be two subsets of P satisfying |P(Pi)|, |P(P 2 )| > m 2 / 3 . 
Given a cut-set of edges K which separates P\ and P 2 in G, we can, in 0(m ) time, produce a set 
of vertices S and an i such that: 

(1) 0 <i<t, 

(2) the size of the interior of S satisfies Aqrr < |P(S')| < fL 

(3) S is disjoint from all sets in subdivision Ci, 

(4) exactly one of the sets Pi, P 2 is contained in S, and the other one is disjoint from S, 

(5) every boundary edge e £ dS is either a boundary edge of Cj with j < i or it comes from the cut 
K, i.e., e £ K. 


At this point, we can use Lemma 4.2 above to produce the set S that we then add to the 
subdivision Cj with i < £ (this is valid thanks to conditions ([3]) and ([ 2 ])). As a result of this update, 
in the new overlay P(C), the component P has been separated into two components P' = P n S 
and P" = P \ S (see condition Q in the Lemma 4.2), both of which might still have too high 
effective resistance diameter. For this reason, we still need to recurse on both these pieces, i.e., we 
call Cut(P') and Cut{P"). As we will shortly see, even though our whole cutting procedure that 
added this new set S was computationally very expensive, i.e., it took 0(m ) time, which might be 
very large compared to the size of P, we can justify this cost by proving that we do not need to 
run this procedure too often. (Roughly speaking, this stems from an observation that the sparsity 
of the cut that S constitutes is relatively small and thus an appropriate charging argument can be 
applied.) 


4.3 Analysis of the Cutting Procedure 

We now proceed to the analysis which shows that our cutting procedure fulfills all its promises. 

Observe that, from the point of view of the running time analysis, our main concern is that 
whenever ball-growing fails, the runtime of the cutting procedure handling this situation is as large 
as 0(m). Fortunately, as the following lemma shows, this does not happen too often. 
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Lemma 4.3 The number of times a new set is inserted into Ci is less than 2* +1 for i < l. 


To see how this provides a bound on the total running time of the cutting procedure from 


Section 4.2, recall that these expensive cut computations are always followed by an insertion of a 
new set into C* with i < i. By the above lemma, summing over all 0 < i < £, we have a total of at 
most = 0(m 1//3 ) of such insertions. The resulting running time bound is 0(?n 4 / 3 ), as desired. 


Proof: [of Lemma |4.3| Note that a new set S may be inserted into Ci with i < t only when ball¬ 
growing fails, i.e., only during the cutting procedure described above in Section 4.2 By condition ([2]) 
of Lemma 4.2, the size of S satisfies < |-E(5)|. Moreover, by condition ([3]), it is disjoint from all 
other sets in Ci, thus any edge of G can be covered by a set from Ci only once. So, as the insertion 
of one set S covers more than new edges, there must be less than 2 l+1 such insertions. □ 

The rest of the running time analysis and the adequacy of our cutting procedure is summarized 
in the following two statements, whose proofs may be found in Appendices |A.5| and |A.6[ respectively. 

Lemma 4.4 Our whole cutting procedure runs in 0(m 4 / 3 ) time. 

Lemma 4.5 The new covering family C is (a + a*)-bounded with a* = log 0 ^ 1 ^ m. 

We are now in a better position to understand the idea behind the notion of a-bounded covering 
families. Namely, if we have a set C that we want to insert into our covering family C, we should 
strive to do so at the lowest level possible (i.e., the lowest level i where the condition |_E(C')| < p 
is satisfied). By doing so, we will guarantee the a-boundedness of the resulting covering family (as 
above), which encodes the trade-off between the number of boundary edges on any level i and sizes 


of sets on that level (see the discussion in Section 3.3). For instance, if we have a large set C, we 


should place it on a low level i of the covering family; this, through the a-boundedness property, 
guarantees that there will be few boundary edges on that level (and such edges are expensive, as 
each one incurs a preprocessing time cost proportional to the maximum size of sets in C, - which is 
roughly the size of C). On the other hand, we should place small sets on high levels of the family; 
because they are small, we can afford to have many more boundary edges incident to them. In this 
way, one can view the 0(logm)-level covering family as a way of making sure that we have a right 
level i on which to insert any set C that we might want to use for shortcutting our random walk. 


5 Fast Sampling from Marginal Distributions 


In this section, we prove Lemma 3.3 That is, we show how, given a covering family C = (Co,.. .Cf) 


in a graph G that is shattering and a-bounded, for some a being polylogarithmic in m, one can 
sample from the minimum-age interior F* of C in (expected) 0(m 4 / 3 ) time. (See Section [ 3 ] for all 
the necessary definitions, and recall that in our convention m refers to the number of edges of our 
original input graph and thus our graph G here can have potentially much less than m edges.) 

As the covering family C that we will be dealing here is fixed, let us simply use V to denote the 
overlay partition V(C) of C and fix r* to be the age of C, i.e., the age age(P) of a minimum-age 
component P in V. Also, let us denote by V* the collection of all such minimum-age components 
P of V, and make V* be the union of the vertices of all the components in V*. 
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Our goal is to sample from the i ? *-marginal distribution of G (cf. Section 3.4). This means that 
we want to sample a random subset of F* according to the distribution that returns F C F* with 
probability 

\{TeTc :THF* =F}\/\Tg\, 


where 7 g denotes the set of all spanning trees of G. In other words, we want to find the intersection 
of F* with a random spanning tree. 

As we already discussed, in light of Theorem this could be done by, first, simulating the 
random walk starting at some vertex s until all the first-visit edges e v to v E V* \ {s} are known 
and, then, returning the intersection F' of F* and all the edges e v found. 

Unfortunately, even though we do not need to learn here what the first-visit edges e v are for 
all vertices of G — only the ones that could be in F *, the expected length of the necessary random 
walk might still be comparable to the cover time of the whole graph. Therefore, we cannot afford 
to simply execute this simulation. 


5.1 Shortcutting 

To cope with the above-mentioned problem and obtain the desired efficient sampling procedure, 
we build on the approach developed by Kelner and Mqdry |22] to sample from the whole random 
spanning tree distribution. 

Namely, the key observation to make here is the following: even though the transcript of the 
whole walk defining the first-edges e v that we need, might be quite long, i.e., even fl(mra), in the 
end, we only utilize at most |U*| < n of its entries. Therefore, one might hope that there is a 
way of directly generating a shortcut transcript of that walk, that is much shorter (and, hopefully, 
possible to generate much more efficiently) while still retaining the information we need. 

To get an intuition how that shortcutting could be implemented, let us consider some subset 
S of vertices of G. (For now, one can think that S corresponds to one of the components of V*.) 
Imagine that we are faithfully simulating the random walk in G step-by-step, except that once 
we cover 5 we start shortcutting our trajectories inside this set. Namely, whenever we enter some 
vertex v E S after 5 was already covered, we would like to immediately leave S by jumping out over 
one of the edges of its boundary dS. Observe that by doing so we are not losing any information 
that we need later - at this point, we already know all the first-visit edges e v for all the vertices 
in 5. So, the only challenge here is to ensure that the edge through which we leave S is sampled 
consistently with the distribution induced by the random walk. 

More precisely, to implement the above shortcutting strategy, we need to be able, for each vertex 
v E 5, to sample from the distribution {P v {e)} ee gs, where P v {e) is the probability that a random 
walk started at v exits 5 for the first time via the edge e. 

Fortunately, as shown by Kelner and Mqdry [22] and Propp Wl by exploiting the connection 
between random walks, electrical flows and the fast (approximate) Laplacian system solvers 141" . 
BUSS} 26,[23], one can indeed construct a data structure - from now on, called S-shortcut sampler 

- that allows one to quickly sample from such edge-exit distribution. Moreover, as made precise 
in the lemma below, this construction is efficient provided that the size of the boundary of S is 
relatively small compared to the size of its interior. 

Lemma 5.1 Given a subset S C V, we can construct a data structure - called 5-shortcut sampler 

- that allows one to sample, for a given vertex v E 5, an edge e E <95 according to the distribution 
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{P v (e)} e£ gs in O(logm) time. The total expected time spent on constructing and maintaining this 
data structure is 0( | S'! + |.E(S)||<9S|). 

This lemma follows from a simple adaptation of the techniques of Kelner and M^dry (see Lemma 7 
and 9 in [22]) and the observation of Propp {37] (also see Lemma 7.3.2 in [32] 1. We sketch its proof 
in Appendix |A.7[ An interesting feature of this construction is that it never computes the exact 
values of the probabilities P v (e ) (this is a consequence of fast Laplacian system solvers being only 
approximate). Nonetheless, it still allows one to obtain an exact sample from the corresponding 
distribution. 


5.2 Our Sampling Algorithm 


Having introduced the shortcutting technique, we can proceed to describing our sampling algorithm. 
For now, we do it assuming a (technical) condition that the overall effective re sista nce diameter 
7 eff(G') of our graph is relatively small, i.e., 7 e ff(G') = 0(m 2 / 3 ). Later, in Section 5.6 we deal with 
this assumption. 

Recall that in [22], Kelner and Mcj,dry obtain their shortcut transcript of a covering random walk 


in G by, first, finding a simple ball-growing-based partition of the graph into pieces that have small 
graph diameter and, then, running simulation of the walk in G while implementing the shortcutting 
procedure (as described above) for each such piece. As this partition cuts relatively small number 
of edges and - due to their small diameter - the individual cover time of each piece is small too, 
they are able to recover the whole random spanning tree within their claimed running time bounds. 

Given all of that, it might be tempting to follow this approach also in our case. Namely, we could 
simulate a random walk in G until it covers all the vertices in V*, while applying the shortcutting 
procedure to each component of the overlay partition V of our covering family C. In this case, we 
know that all these components have small effective resistance diameter (as C is shattering). So, 
it follows (cf. Section 5.4) that the individual cover time of each is sufficiently small, as needed. 


Furthermore, one might hope that the fact that C is a-bounded implies that the size of boundaries of 
each of these components is also sufficiently small (compared to their size) and thus the construction 
times of corresponding S-shortcut samplers 
Unfortunately, in general, 


(see Lemma 5.1) are acceptable, 
this is not the case. 


It might actually happen that the sizes of 
boundaries of the components of V are prohibitively large and thus we are still hitting here the 


same bottleneck as the original Kelner-Mcglry approach faces - see our discussion in Section 1.2 
(In fact, the graph from Figure [l] is exactly the example showing that.) 

In light of the above, we have to design our shortcutting strategy in a different and much more 
careful way. This will crucially exploit our lack of need of knowing what the first-visit edges are 
for the components of V that are not in V*. Namely, the key realization here is that if all we 
care about is preserving the first-visit information only for the components of V *, there is much 
more flexibility in the way we implement the shortcutting of our walk. In particular, for a given 
component P of V* we do not really need to take S = P and construct the corresponding S'-shortcut 


sampler (cf. Lemma 5.1); it suffices to find some different - larger set S(P) that, on one hand, 
contains P and has small number of boundary edges and, on the other hand, does not overlap any 
other component in V* (but still can overlap components that are components of V \ V*). In fact, 
a similar observation applies also to other components of V (that might not be in V*). 
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To make this precise, for a given component P of the overlay V (that might or might not be in 
V*), let us call a set S C V an escape set for P iff P C S and S does not intersect any component 
of V* other than P (if P is in V*). 

Now, it turns out that once we find for each component P its escape set S(P ) which is not too 
big and has sufficiently small boundary, we can use these sets to generate a shortcut version of our 
random walk. This shortcutting is based on applying a procedure that, whenever our walk enters 
a vertex v belonging to some component P of V, does the following: 

• if P £ V*: 

— if P has already been covered, perform a jump out of 5(P); 

— otherwise: faithfully simulate a single step of the walk; 

• otherwise: perform a jump out of S(P). 

We stop the random walk at the moment we have covered all components Pg?*. Note that this 
procedure always faithfully simulates the walk inside a component from V* until this component 
is fully covered. Also, whenever we perfom a jump over an escape set and land directly inside a 
component of V*, we know what the last traversed edge (through which we entered this component) 
was. So this shortcutting still retains all the information needed to recover the sample from the 
F* -marginal. 

Now, of course, the crucial part is choosing the sets S(P) appropriately, so that the resulting 
running time of our simulation is within the desired bounds. This is the aim of the next section. 

5.3 Choosing Appropriate Escape Sets 

Our goal here is to define the escape sets S(P) for P G V that we use for shortcutting the walk, as 
described above. To this end, let us introduce a new piece of notation. 

For a given 0 < j < £, let us define Vj to be the partition resulting from superimposing on top 
of each other all the sets from subdivisions Co,Ci,... ,Cj. (So, we have that = V, Vo = {V} and 
each Vj is a coarsening of Vj+i-) 

Furthermore, for a component P G V, let us define Vj(P ) to be the component of Vj that 
contains P. Finally, we say that a component R of Vj is modest if it is contained in a set S G Cj. 
The reason why we want to distinguish modest components is because they are small enough for 
our purposes. Namely, if R is modest on level j and R C S G Cj , then \E(R)\ < |P(5)| < m/2 J , 
which might not be true for an arbitrary component. 

Lemma 5.2 Let Pi and P -2 be different components of V with age (Pi) = age(P 2 ) = r. Then they 
belong to different components ofV r : V r (P \) V r (P‘ 2 )- 

Proof: Since Pi P- 2 , for some i* there must exist a set S G C,* such that exactly one of Pi, P 2 

is contained in S. But because of age, neither Pi nor P 2 is contained in any set S G C for i > r. 
Therefore, i* < r. □ 

We infer from the lemma that V r *{P \) and P r *(P 2 ) are disjoint and are both modest whenever 
Pi, P 2 are two different pieces in V*. It follows that V r * (P) satisfies all the required conditions for 
an escape set of P G V *, hence we define S(P) := V r *(P). 
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For the remaining components P E V \ V* we choose S(P) := W(P) - the witness set of P. 
Then we have P C W(P), W(P ) G C a ge(P) and age(P) > r*. So clearly, none of the components in 
P* intersects S'(P). 

Let us now look more closely at the exceptional situation when r* = 0. One may see that in 
this case there is exactly one component in V of age 0, say P. It appears that according to our 
definition, S(P) = V, which makes little sense (we cannot jump out of V). Fortunately, in this 
special case we stop the walk immediately after the single component gets covered. This means 
there is no use for the S(P) set. Note that this issue concerns only the border case r* = 0; if r* > 0, 
every S(P) is a proper subset of V and has some boundary edges through which we may jump out 
of it. 

In Section 5.5 we will see that such a choice of escape sets allows us to prepare all S(P)-shortcut 
samplers in reasonable time. Now we wish to proceed to the runtime analysis of our entire sampling 
procedure. We start with some preparations in the next section. 


5.4 Random Walks Restricted To Subgraphs 

In this section we state a lemma which will be instrumental to our analysis of the random walk 
simulation. It bounds the expected length of a random walk which covers a given subgraph. 

Lemma 5.3 Let G = (V, E ) be an unweighted graph and V' C V. Also let E' C E{V') be any set of 
edges inside V'. Suppose we run a random walk in G from an arbitrary vertex (not necessarily from 
V') and stop it once all vertices from V' have been visited. Then the expected number of traversals 
of edges from the set E' is 

0{\E'\ • 7eff(0 • log n). 

We prove this lemma in Appendix |A.8[ It may be viewed as a generalization of Lemma |2.2| in 
two aspects. First, it lets us look only at traversals of a subset of edges of the graph (for example, 
in the analysis of our algorithm this might be the set of boundary edges of C). Second, we can 
restrict our attention to a subgraph induced by a vertex subset V' and answer the question: what 
is the subgraph cover time , i.e., how many traversals of edges between vertices of V' will there be if 
we decide to stop the walk as soon as V' is covered? 

Note that this improves upon Lemma 6 of [22], not only by considering 7 e fj rather than A, but 
also by dropping the assumption that the number of boundary edges of V' is at most \E(V')\, and 
by letting us only consider a subset of edges. It also subsumes Fact 5 of (22] - We thus see that both 
directions of generalizing the upper bound of 0(mn) for the cover time were known previously, but 
in weaker versions, separately and not using effective resistance. We believe that Lemma |5.3| is a 
robust and useful statement which is of independent interest. 

We now apply the above lemma to analyze our algorithm. 


5.5 Analysis of the Sampling Algorithm 

If we recall our simulation of the random walk that we described in the previous section, there are 
two main ingredients in the running time of that procedure: the expected time needed to prepare 
the data structures for implementing shortcutting as in Section |5.1[ and the expected running time 
of the actual simulation of the shortcut random walk. Below, we analyze both of these separately. 
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Lemma 5.4 The total expected time spent on preparing the structures for shortcutting sets S(P) 
for P E V* is 0(m 4 / 3 ). 


Proof: To initialize our data structures, we need to construct the shortcut samplers for all 

components in Cj for i = 1,... ,£ and modest components of V r *. By Lemma |5.1[ computing and 
maintaining a shortcut sampler for a set S can be done in time 0(|S| + |£1(5)| • \dS\), where dS is 
the set of boundary edges of S. 

and the bound |£ , (S')| < p, the time 


Consider first all sets S E Cj, for some i. By Fact 


3.1 


needed to maintain the corresponding data structures is at most 


J20(\S\ + \E(S)\-\dS\) = 0[n+^|£(S)|-|aS|] = O I n + ^ ^ |55| 


SeCi 


SeCi 


Sec, 


0 \ n+ ffT l 95 l ) = 0 { n+ RT ml/32 *) = 0(m 4/3 ). 


S£Ci 


Then, summing this time over alii = 1,... we obtain that the overall time is 0(m 4 / 3 ) as well. 

Secondly, let us consider all modest components R E V r *. First notice that e E E is a boundary 
edge of V r * iff it is a boundary edge of Cj for some i < r* . Hence, the number of such edges is, by 
Fact |3. H 

^ 6 (m 1 / 3 • 2 *) = Oim 1 / 3 ■ 2 r *) 

i<r* 

as well. Also, for a modest component R there exists an S E C r * with R C S, which implies that 
|Fl(i?)| < |iH(5')| < 2 ™ . As a result, we can see that the previous calculation (that we made for the 
sets S E Cj) holds also in this case. We conclude that the total time spent on maintaining shortcut 
samplers is 0 (m 4 / 3 ), as desired. □ 


Lemma 5.5 The expected time of performing the shortcut random walk simulation is 0{m A / 3 ). 


Proof: Observe that there are two types of steps in our procedure: 

(a) regular, single steps inside some age-r* component, before it is covered, 

(b) jumps over a Cj set (for some j > r*) or over a component in V r *. 

Consider a component P E V* . The expected time spent on steps of type (a) within P is bounded 
by 0(\E(P)\ ■ 7 eff(P) • logn) by Lemma ^3 (used with V' = P and E' = E{P)). Since the effective 
resistance diameter of P is 0(m 1//3 ), this is 0(m 1 / 3 \E(P)\). Thus, the expected total time spent 
on type-(a) steps is 


yP 0{m l / 3 \E{P)\) = 0{m 1 / 3 ^ \E{P)\) = Oim 1 / 3 ■ m) = 0(m 4 / 3 ). 
PeV* Pe"P* 


Let us now concentrate on steps of type (b). Recall that every step of this kind is carried out 
using one query to the corresponding shortcutting structure and thus it takes O(logm) time. Each 
such jump ends on a boundary edge (either of some Cj set or some component in V r *). So we 
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can think that every jump corresponds to a boundary edge traversal in the underlying unshortcut 
random walk. 

We can now bound the time required for type-(b) steps simply by the expected number of times a 
boundary edge is traversed during our simulation. Note that by the time the whole graph is covered 


to bound the time spent on type-(b) steps. We set E' to be the set of all boundary edges, V = V, 
and we recall that 7 e ff(G') = 0(?n 2//3 ) (the small-diameter assumption) and \E'\ = 0(m 2 / 3 ). Thus 
we obtain the desired bound of 0(m 4 / 3 ). 

□ 


for the first time we are certain to be done with our simulation. Therefore we may use Lemma 5.3 


By Lemmas |5.4| and 5.5 the total expected running time of sampling (in the small-diameter 
case) is 0(m 4//3 ). 


5.6 Dealing with the Large-Diameter Case 

Recall that so far we were assuming for simplicity that the effective resistance diameter of G is 
small, i.e., 7 e fi(G') = 0(m 2 / 3 ). We will show how to deal with this assumption by slightly changing 
our algorithm. 

We have a graph G and a collection of disjoint subgraphs V* C V. We intend to sample a 
set according to the _F*-marginal distribution, where F* = Upep* A’(P). We already know how to 
simulate the random walk in G in such a way that we pay only for: 


(a) traversing interior edges of every component until it is covered, 

(b) traversing edges coming from a set E' of size 0(m 2//3 ). 


As seen in the proof of Lemma 5.5, the time complexity of (a) is 0(m 4 / 3 ) irrespective of G, but 


the time complexity of (b) is 0(7 e ff(G')m 2//3 ), which is too much if we are not able to bound the 
diameter of G by 0(m 2//3 ). To circumvent this problem, we use the ball-growing procedure to 
cut G into small-diameter pieces (cf. 2.5). Namely, we obtain a partition V = Ui=i A* satisfying 


A (Fi) = 0(m 2 / 3 ) and \E"\ = 0( m 1 / 3 ), where E" = E \ (Jf=i E(Fi) is the set of boundary edges of 
this partition. Take E' to be the set of all boundary edges (of all subdivisions C). By log°^ m- 
boundedness of C we have \E'\ = 0(m 2 / 3 ). 

In the new algorithm, whenever we cover all interesting vertices (those from V*) within some 
F t , we start shortcutting the walk over the whole F t . This allows us to treat every edge e E E'\ E" 
as part of a small-diameter graph F % and pay for traversing it only until F t is covered. In the next 
subsection we make this intuition formal. 

We now describe the strategy used to simulate the random walk. Suppose we are at a vertex v 
and want to make a step in our simulation. Let Fj be the small-diameter piece that v belongs to. 
Then: 


1. if all vertices in V* n Tj have already been covered: jump over F, in one step, 

2 . otherwise: 

(a) let P be the component in V such that v € P, 

(b) if PeV*: 

i. if P is covered: jump over the S(P) set, 
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ii. else: perform one regular step of the random walk, 

(c) else: jump over S(P). 

Note that case 2. is exactly the same as our simplified strategy from the previous subsections. 


5.7 Analysis of Sampling in the Large-Diameter Case 

As in the small-diameter case, we need to bound two quantities: time spent on preparing the 
shortcutting structures and the expected time of simulation. 


Lemma 5.6 The total expected time spent on preparing the structures for shortcutting sets Ci for 
i = modest components ofV r * and Fi for i = 1 ,..., q is 0(m 4 / 3 ). 


Proof: An additional part compared to the simplified case ar e th e FiS. The expected time needed 

to maintain all the shortcut samplers for them is (use Lemma 5.1): E?=i D(|Pj| + |P(P,)| |dPj|) = 
0(n + ELi lEWWdFi]) = 0(ELi \E(Fi)\\E"\) = d(m\E"\) = 0(m 4 / 3 ). □ 


Lemma 5.7 The expected time of performing the random walk simulation is 0(m 4//3 ). 


Proof: We remarked previously that the expected time spent on covering P’s is bounded by 

0(m 4//3 ). Thus we only need to estimate the expected number of jumps: those from step 1. and 
those from step 2. Like in the small-diameter case, we argue that each jump in 1. corresponds to 
an P 7, -edge traversal. Since \E"\ = 0(m 1//3 ), we may easily bou nd t he total number of P 77 -edge 


traversals during a covering walk of G by 0(m 4 / 3 ) (use Lemma 5.3 with V' = V and 7 e ff(G) < 


A (G) < n ). Similarly, for step 2., each jump ends with an P 7 -edge traversal, so it remains only to 
deal with jumping over P 7 \P 77 -edges. There are 0(m 2//3 ) of them, and each lies inside a component 
F = F. 

Fix F and let E' F = E' n P(P). Our aim is to bound the number of P^-traversals until F is 
covered. This will also be a bound for the number of jumps ending with an P^-edge. To this end, 


we use Lemma 


5.3 


with V' = P and the edge set being E' F . Because 7 e ff(K 7 ) = 0(m 2//3 ), this yields 
m 2 / 3 ). 


a bound of 0(\E F 

Summing this over all P’s we obtain that the total time spent on E' \ P 77 -edge traversals caused 
by jumps is at most Y2i=iO(\E F . \ ■ m 2//3 ) = 0(|P / \ E"\ ■ m 2 / 3 ) = 0(m 4 / 3 ). This completes our 
argument. □ 

The two lemmas above let us conclude that the sampling can be performed in total expected 
time 0(m 4 / 3 ). 
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A Appendix 


A.l Proof of Lemma 12.21 


We prove the lower bound only; the upper bound is a special case of Lemma 5.3 


Let us denote by H ( u , v) the expected number of steps before a random walk starting at u visits 
v. We have n{u, v) = H{u,v) + H(v,u) and hence rna x(H(u,v), H(v,u)) > \n{u,v). Now one just 
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needs to perform the simple estimation that 


Cov(G) > max H(u,v) > - max k(u,v ) = max mR e e(u,v) = m'y e s(G), 
u,vEV 2 u,v£V u,v£V 

where the first inequality follows as covering a graph G starting from u requires hitting all the other 
vertices first, and the first equality is by definition of effective resistance - see Section |ih2j 

A.2 Proof of Lemma 12.41 

We employ here an adaptation of the partitioning procedure from |22] that is itself based on the 
ball-growing technique of Leighton and Rao [29| • 

For a subset of vertices V' C V let us define the set of neighbors N(V') to be V' U {w E 
V : (v,w) E E for some v E V'}. Also recall that dV' denotes the set of edges with exactly one 
endpoint in V'. 

We start with the original G, i = 1 and perform the following procedure until G is empty: 

• choose any vertex v E G and set D := {u}; 

• while \dD\ > 4>\E(D)\ do: set D := N(D)\ 

• add D as a new Vi\ 

• remove D and all incident edges from G; 

• i := i + 1. 

Obviously Pi,...,14 is a partition of V. One can also easily prove that the condition (2) in 
Definition |2.3| is satisfied. It remains to show that each piece has a small diameter, i.e., establish 
condition (1). 

To this end, note that the radius of D at the moment it is added as a new piece is bounded by 
the number of the while-loop executions. We also see that each execution increases \E(D)\ at least 
by a factor of l+</>. This means that after s steps, \E{D)\ > (1+</>) s . Since (1 + 0) lo s( m+1 )/^ > m+1 
for 4> < g, we deduce that the radius of D is bounded by log (” t+1 ) arK j thus the diameter of D is at 

most 21 °gb n + 1 ) _ Jt is not hard to see that the presented algorithm can be implemented to run in 
0 (m + n) time. 

A.3 Proof of Lemma 13.41 

It is not hard to see that given F' and F*, the construction of the corresponding (F*, P^-conditioned 
graph G' can be done in 0(m) time. Also, note that as edges of F* (and thus also edges of F') 
are always interior edges of components of the overlay V(C) of the covering family C, removing or 
contracting them translates to removing or contracting the interior edges of the sets in C. So, doing 
that (which can again be performed in 0(m ) time, as each edge can be in at most £+1 = O(logm) 
different sets) makes the resulting covering family C" remain valid for G' and still a-bounded. (As 
mentioned before, the value of m used in the definition of the covering family and a-boundedness 
always refers to the number of edges of the original graph and thus remains unchanged here.) 

Now, observe that the only difference between the overlay V(C) of C and the overlay V{C") of C" 
is that each minimum-age component P of V(C) has its interior edges removed and/or contracted 
in V(C"). This means that each such P corresponds to a minimum-age component P' in V(C") 
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that is a collection of vertices with empty interior (the only edges incident to these vertices are the 
boundary edges of P, which now also form the boundary of P'). 

So, if the age r* of C (and thus that of C") was equal to £, then all the components of V(C) 
must have been minimum-age and thus all of them have empty interiors now. Given the way the 
overlay components are defined, this means that each edge remaining in G' is part of the boundary 
of some set in C" . Thus, in that case, we simply take C equal to C " and the lemma follows. 

We can, therefore, restrict ourselves to the case when r* is actually strictly smaller than l. In 
this case, let us take C to be a covering family in G' that arises from adding, for each vertex of each 
minimum-age component P' of V(C"), a singleton set containing only that vertex to the upper-most 
subdivision C” of C" . (Note that as r* < l and P' is minimum-age, it does not intersect any of the 
sets in C” and thus adding these singleton sets to C'[ is valid.) 

Observe that performing this operation results in each component P' being split into single¬ 
vertex sets in the overlay V(C') of C and each of these sets being of age £. So, as r* < £, this 
increases the age of all the minimum-age components of V(C") and thus the age of C is at least 
r* + 1 now. 

In light of the above, all that remains to be done is to argue that C is still a-bounded. To this 
end, note that adding our singleton sets does not introduce any new boundary edges. This is so 
since, as we already noted above, the only edges incident to vertices of minimum-age components 
P' of V(C ") were boundary edges. Furthermore, as these singleton sets are added to C'[, it cannot 
affect these edges’ status as i-based boundary edges. That is, if such an edge was an i-based 
boundary edge in C" then it remains such in C . Thus, as the a-boundedness property boils down 
to bounding the number of i-based boundary edges for each 0 < i < £, it must be preserved by our 
operation and thus the covering family C is indeed a-bounded, as desired. 


A.4 Proof of Lemma 14.21 


The cut K separates G into connected components (possibly more than two) - some of them 
intersect P\, some intersect P2 and some intersect neither (but none can intersect both). Let 
A] be the union of those that intersect Pi and A 2 be the union of the others; we thus obtain a 
decomposition G = A\ U A2 of G where Pi C A\, P2 C A 2 and all edges between A\ and A 2 come 
from the cut K, i.e., E(Ai, A2) C K. 

Our plan will be to choose the set S as a subset of W (P) of at most half its size and thus set i to 
be larger than age(P). (Recall that W(P) denotes the age witness of P - see Section|3.4|) Because 


no set in C l with i > age(P) intersects P, this will make it easier for us to guarantee condition <§• 
So let us define U\ := W(P) 0 A±, U 2 := W(P) 0 A 2 and assume wlog that \E(Ui)\ < \E(U 2 )\- 
Since U\ and U 2 are disjoint and JJ\ U U 2 = W(P), we get that 


|£(t/.)l < \\E(W(P))\ < (7) 

where we used the bound Q. 

The set U\ is our candidate for the set S. We will now modify it appropriately (by shrinking) 
in order to satisfy all the required conditions. 

Recall that we want to enforce < |P(S')| < p- Let i be the right level for U\, that is, the 
smallest i such that < |P(<S')|. By ([7]) we know that i > age(P). We would ideally like to 
return U\ as a candidate for insertion into Ci. However, U\ may intersect some sets in Ci, which 
violates our condition ([3]). To deal with this problem, note that since i > age(P), no set in Ci 
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intersects P (or Pi), so we can remove the intersection of U\ with all the sets in p. and we still 
have that Pi D Pi. 

Of course, now - after removing this intersection with p - the size |P(Pi)| of the interior of Pi 
might have decreased considerably. Pi still contains Pi, which is a large piece, so it cannot be too 
small, i.e., it has size at least m 2 / 3 , but it might no longer hold that |P(Pi)| > If this indeed 
happens, then we appropriately increase i, and repeat. 

The resulting set-refining routine works as follows. We start with i = 1 and then: 

(a) if |P(Pi)| < 2 ?ft> increment i and repeat; 

(b) otherwise, if the intersection of all sets in p with U\ is nonempty, remove it from Pi and repeat 
(starting from step (a)); 

(c) otherwise, we can stop and output the desired S := U\ and i. 

Let us now verify the satisfaction of all conditions. Conditions (§, © and © are clear given 
the design of our set-refining routine. To establish condition ©, notice that i < t because even 
after all the removals, Pi still contains Pi, hence p > |P(Pi)| > |P(Pi)| > ?b 2 / 3 = p. Also 
i > age(P) > 0. 

Regarding condition ©> observe that all boundary edges of Pi are either boundary edges of 
A\ (and they come from the cut K), or arise as a result of intersecting A\ with the age witness 
W(P) 6 C a ge(P) and taking a set difference with some number of sets from subdivisions Cj with 
j < i. In the latter case, they are already boundary edges of C on the respective levels age(P) < i 
and j < i. 


A. 5 Proof of Lemma 14.41 


Note that by Theorem 2.1, the construction of our low-dimensional embedding 1Z can be carried 
out in O(rn) time. 

Let us now consider the runtime of the Cut (P) procedure for one component P E V(C). If the 


ball-growing does not fail, then, using Lemma 2.4, we can bound this running time by 0(|P| + 


|P(P)|). On the other hand, if ball-growing fails, then this running time is equal to 0(m ) plus the 
time needed to execute the recursive calls to Cut(P') and Cut(P"). 

Therefore, to bound the total running time Cut(P), we model it using a binary tree whose leaves 
correspond to ball-growing succeeding and internal nodes describe handling ball-growing failures 
and subsequent recursive calls. By summing the contributions of all the nodes of this tree, we can 
conclude that the total time spent on all calls to the Cut(P) procedure is at most 

0(m-c)+ ^ 0(\P\ + \E(P)\) = 0(m ■ c +m) 

P&V{C) 

where c is the total number of cuts computed during handling ball-growing failures (the sum is 
taken over the final overlay P(C)). 

No w, ev ery time we compute a cut, we end up adding a new set to p with 0 < i < i. By 
Lemma 


4.3 


this can happen at most 'Yl!i= * 2* = 0(2^) = 0(m 1//3 ) times. Hence we get a runtime 


of 0{m ■ c + m ) = 0(t7i 4 / 3 ). 
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A.6 Proof of Lemma 14.51 

We need to prove that, for every i, the number of Abased boundary edges is at most {a + a*)m}^ 2* 
with a* = log°^ m. To this end, we will show that the number of new Abased boundary edges 
(ones that were not present in the original C ) is 0(m 1//3 2 l ). 

Case i < £: every new Abased boundary edge is the boundary edge of a new set S E C 1 (added 
in the cutting case). Therefore we only need to bound the number of Abased boundary edges 
introduced by all new sets 5 G C;. If we can prove that adding a new set S E C\ introduces at 
most 0(m 1//3 ) new Abased boundary edges, we will be done by Lemma 


4.3 


But condition ([5]) of 

Lemma |4.2| means exactly that all new Abased boundary edges of such a set S must come from 
the cut K of the phase when S was added. And recall that Lemma |4 .1 1 applied to the components 
Pi and P 2 , of low diameter m 1 / 3 and high effective resistance of P(?n 1//3 ) between them, gave us 
exactly the bound m 1//3 for the cardinality of the cut K, as computed in ([5]). 

Case i = t: every new Abased boundary edge is a boundary edge of a new small piece (added 
when ball-growing succeeds). During the entire partitioning procedure, all the calls to the ball¬ 
growing primitive that introduce new Abased boundary edges concern disjoint subgraphs H of G. 
Each such call yields a set of cut edges of size O ( ^^\E(H)\ ), so all the calls result in at most 


O 


(mPa) = 0(m 2 / 3 ) = 0(m 1//3 • 2 e ) cut edges. 


A.7 Proof of Lemma 15.11 

Let us first note that if the interior E(S) of S has more than one connected component, then 
the problem decomposes into completely independent subproblems corresponding to each of these 
connected components. So, from now on, we can assume that E(S) forms a connected graph and, 
in particular, |S[ < |P(S')| + 1. 

Fix e = (u, u') E dS (with u E S and v! ^ S) and suppose we want to calculate the probabilities 
P v (e) for all v E S. Let us construct an auxiliary graph G' as follows. The vertex set is S'U{'U / ,tt*}, 
where u* is a dummy vertex. We include in G' all edges from E(S) and the edge e; we also include 
all boundary edges of S (those going from S to V \ S) except e, but we change their endpoint that 
originally belonged to V \ S to be u* now. 

Now, it is not hard to see that P v (e ) is equal to the probability q v that the random walk in 
this auxiliary graph starting at v will visit v! before hitting u*. The latter probability turns out, 
however, to be closely related to the electrical structure of the graph G'. 

More precisely, let us treat G' as an electrical network, where every edge has unit resistance. It 
is known (see Chapter 4 in [3D]) that when we impose a voltage of 1 and 0 at v! and u* respectively, 
the voltage at v becomes exactly q v (and thus P v (e)). 

Furthermore, this probability q v can be computed by solving a Laplacian linear system of 
the form L'x = y , where L' is the discrete Laplacian of G'. In fact, in this way we can get such 
probabilities q v for all possible starting vertices v E S simultaneously. For details, refer to Lemma 9 
in [22]. 

Therefore, we see that computing all the probabilities P v (e) boils down to solving a Laplacian 
linear system L'x = y for all the graphs G' corresponding to each edge e E dS. 

As shown first by Spielman and Teng [42| (see also [25, S3 E3]), solving such a system is possible 
in time that is nearly-linear with respect to the number of edges in G'. However, this method does 
not provide us with exact values of P„(e)’s - it outputs only approximate values with prespecified 
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precision e. So, in our case, for a given e > 0, the total computation takes 0(|-E(S')||<9,S'| log 1/e) 
time and computes the values of all P v (e) s up to an additive error of e. 

Now, suppose for a moment that all these computed probabilities were exact. Then, for a fixed 
v E S, it is easy to build a simple array structure which allows us to sample edges from dS with 
the desired distribution. It boils down to constructing first an array of prefix sums of size |<9S'| and 
then, whenever a sample is needed, performing a simple binary search in this array, which takes 
time O(logm). (See Lemma 7 in [22] for details.) 

Finally, the way of coping with the precision issue is based on an idea due to Propp EH- Here, 
we only explain the idea on a simpler example and refer the reader to Lemma 7.3.2 in [32] for a 
complete description. 

In this example, assume we want to take a sample from a p-biased Bernoulli distribution, i.e., 
sample a 0-1 random variable that is 0 with probability p and 1 with probability 1 — p. If we knew 
the value of p, we could perform this sampling by simply choosing a value r E [0,1] uniformly at 
random and then outputting 0 if r < p and 1 otherwise. 

Assume now that instead of the exact value of p we only have access to an algorithm which, 
given s, outputs p with precision ±e in time proportional to T • log 1/e, for some T > 0. To perform 
our sampling in this case, we again choose a uniformly random r E [0,1] and then compare it to 
the current approximation p' E [p — e,p + e] of the value of p, for some e > 0. 

Observe that as long as r < p' — e or r > p' + e, we can simply output 0 and 1 respectively, 
as we are certain that r < p or, respectively, r > p. Now, in the remaining case of r E \p' — 
£, p' + e], we simply keep halving the value of e (and computing the corresponding refinement of 
our approximation to the value of p) until we are able to conclude that either r < p or r > p (in 
which case we can output an appropriate value). 

Clearly, this procedure terminates (with probability 1) and outputs the correct sample. It 
thus remains to analyze the expected time spent on recomputing p'. To this end, note that the 
probability that during our sampling the required precision will be less than 2 ~ k is at most 2 ~ k+l . 
So, the expected time spent on recomputing p' is bounded by 

E Jc • T ■ M1/2- 1 ) = T • £ = 0(T). 

k> 1 Z k> 1 

Similarly, one can easily show that if we needed s samples (instead of only one) then this expected 
time would become 0{T • logs). 

A.8 Proof of Lemma 15.31 

Let us define the hitting time H(u, v ) between two vertices u, v E V to be the expected number of 
steps before a random walk starting at u visits v. We have k{u, v ) = H(u, v) + H(y, u ). 

We break the proof down into a sequence of lemmas. 

Lemma A.l (cf. Lemma 2.8 of [30]) Let V' C V be a subset of vertices, s E V - a starting 
vertex, and let M = max„ g y/ H(s,v) be the maximum hitting time between s and a vertex in V'. 
Suppose we run a random walk from s until more than half of the vertices from V' are visited. The 
expected length of this walk is at most 2M. 
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Proof: We assume first that \V'\ = 2k + 1 is odd. Let a v be the time when vertex v is first 

visited. Then the length of the walk, (3, is the (k + l)-th largest of the a v . Hence 

(k + l)/3 < ^2 ol v 

v&V' 


ana 

1 1 ‘2k 4 - 1 

m < ^ E £ rrr ■ i 1 "' ■ M = < 2M - 

vev 1 

If \V’\ = 2 k is even, we proceed analogously and obtain fifM < 2 M at the end. 


□ 


Lemma A.2 Let V' C V. Suppose we run a random walk in G from a vertex v G V' and stop it 
on the first visit to v after all vertices from V' have been visited. Then the expected number of steps 
is 


0(\E\ ■ 7eff(H') • logn). 


Proof: The proof is similar to that of Theorem 2.7 of m- We divide the walk into phases. The 

first one ends when more than half of vertices from V' have been visited. By Lemma A.l (with 
s = v), we know that its expected length is at most 


2ma y:H(v,u) < 2ma xk(v,u) = A\E\ maxi? e ff(u, u) = OLE^^iV')) 
u&V' u&V' u&V' 


(recall that we defined R e g(v,u ) to be K z\e\ )• 

The second phase ends when at least half of the remaining vertices of V' have been covered, 
and the lemma applied to that set of vertices (and the starting vertex s being the vertex where 
the first phase ended) again bounds the expected length of the second phase by the same quantity. 
(It is important here that s 6 V\ as the first phase must have ended with a visit to a vertex of 
V'.) We proceed in this fashion until V 1 is covered. By linearity of expectation, the expected 
length of the walk up to this point is 0{\E\^f e g(V')) times the number of phases, which is clearly 
log \ V'\ = O(logn). (Note that the number of phases is a fixed quantity here.) Finally, the 
remaining (expected) time needed to return to v from the last visited vertex of V' is at most 
rna x ue v h\u,v) = 0(\E\j eS (V')). □ 


Now, we will argue that every edge in G has exactly the same contribution to the expected 

Intuitively, this is because the (infinite) stationary 


length of the random walk of Lemma A.2 


random walk on G (where, in the long run, every edge has the same contribution) can be viewed as 
an (infinite) sequence of samples from the trajectory of our random walk (plus an initial fragment 
of bounded expected length). Thus, if any edge contributed more than a 1/|£'|-fraction to the 
length of our walk, then this discrepancy would also emerge in the stationary walk, which is a 
contradiction. 

We proceed to formalizing the above intuition. To this end, let us state the following lemma 
from renewal theory. It appears as Proposition 7.3 in 


Lemma A.3 Let X\, X 2 , ■ ■ ■ be an infinite sequence of nonnegative i.i.d. random variables with 
finite expectation, i.e., X\ > 0, E[Xi] < 00 . Let Ri,R, 2 ,... be another such sequence. (Each 
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sequence is independent, but R{ may depend on A* for every i.) For any t > 0, set R(t) = 
Ri + R 2 + • • • + R m , where N(t ) is the maximum integer such that X\ + X 2 + • • • + X N ^ < t. 
Then 

E[i2(i)] E [R^ 

lim - = —7—r. 

t —>00 t E[Ai] 

We are now ready to analyze the number of traversals over a given edge e in our random walk. 


Lemma A.4 Let V 1 C V and v E V'. Also, fix any edge e E E. Consider again the random walk 
from the statement of Lemma A.2. The expected number of times e will be traversed during this 
walk is 0(7eff(V ') • logn). 


Proof: Consider an infinite random walk starting at v. We will divide this walk into phases, each 

of which starts at v and ends on the first visit to v after all vertices from V' have been visited. In 
this way we get an i.i.d. sequence of copies of the random walk of Lemma A.2 For every i let A, 


be the length of the z-th phase, and let R{ be the number of times the edge e was traversed during 
this phase. By Lemma |A.3| we obtain that 


the expected number of traversals of edge e during one phase 
the expected length of one phase 


mi\ = lim mm 

E[Ai] t-Kx) t 


where R(t ) is the number of traversals of e during all phases which have finished by time t. We 
need to show that this quantity is equal to 1/|A|. 

Indeed, if we define R'(t) to be the number of traversals of e up to time t , then it is well- 
known that ^lirri ^ (which is the stationary probability of traversing e). Furthermore, 

the difference between R'(t ) and R(t) is just the number of traversals of e during the current phase 
at time t, and we can upper-bound E [R'(t) — i?(t)] by E[Ai]. Hence, we have that 


E[i?'(f)] E [R(t)} _ E[R'(t)}-E[R'(t)~ R{t)} > E[#(t)] - E[A t ] 
t ~ t t ~ t 

Taking a limit of t —> 00 of both sides concludes the proof. 


□ 


Now Lemma |5.3| follows easily. Namely, let us consider a random walk started at an arbitrary 
vertex (possibly outside of V) and let no be the first vertex in V' on that walk. (No edges from E' 
will be traversed before vq is reached, as E' C E(V').) Applying Lemma A. 4 with v = no to each 
edge e £ £', we obtain that the total number of traversals of edges in E' in this random walk until 
the set V' is covered is 0(\E'\ ■ 7 e ff(P / ) • logn), as we wanted to show. 
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